NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

A probabilistic modeling framework for genomic networks incorporating sample heterogeneity

https://doi.org/10.1016/j.crmeth.2025.100984

Chen, Liying; Acharyya, Satwik; Luo, Chunyu; Ni, Yang; Baladandayuthapani, Veerabhadran (February 2025, Cell Reports Methods)

Free, publicly-accessible full text available February 1, 2026
Robust Bayesian graphical regression models for assessing tumor heterogeneity in proteomic networks

https://doi.org/10.1093/biomtc/ujae160

Yao, Tsung-Hung; Ni, Yang; Bhadra, Anindya; Kang, Jian; Baladandayuthapani, Veerabhadran (January 2025, Biometrics)

ABSTRACT Graphical models are powerful tools to investigate complex dependency structures in high-throughput datasets. However, most existing graphical models make one of two canonical assumptions: (i) a homogeneous graph with a common network for all subjects or (ii) an assumption of normality, especially in the context of Gaussian graphical models. Both assumptions are restrictive and can fail to hold in certain applications such as proteomic networks in cancer. To this end, we propose an approach termed robust Bayesian graphical regression (rBGR) to estimate heterogeneous graphs for non-normally distributed data. rBGR is a flexible framework that accommodates non-normality through random marginal transformations and constructs covariate-dependent graphs to accommodate heterogeneity through graphical regression techniques. We formulate a new characterization of edge dependencies in such models called conditional sign independence with covariates, along with an efficient posterior sampling algorithm. In simulation studies, we demonstrate that rBGR outperforms existing graphical regression models for data generated under various levels of non-normality in both edge and covariate selection. We use rBGR to assess proteomic networks in lung and ovarian cancers to systematically investigate the effects of immunogenic heterogeneity within tumors. Our analyses reveal several important protein–protein interactions that are differentially associated with the immune cell abundance; some corroborate existing biological knowledge, whereas others are novel findings.
more » « less
Full Text Available
Tumor radiogenomics in gliomas with Bayesian layered variable selection

https://doi.org/10.1016/j.media.2023.102964

Mohammed, Shariq; Kurtek, Sebastian; Bharath, Karthik; Rao, Arvind; Baladandayuthapani, Veerabhadran (December 2023, Medical Image Analysis)

Full Text Available
Bayesian Covariate-Dependent Gaussian Graphical Models with Varying Structure

Ni, Yang; Stingo, Francesco; Baladandayuthapani, Veerabhadran (January 2022, Journal of machine learning research)

Full Text Available
Bayesian Hierarchical Quantile Regression with Application to Characterizing the Immune Architecture of Lung Cancer

https://doi.org/10.1111/biom.13774

Das, Priyam; Peterson, Christine_B; Ni, Yang; Reuben, Alexandre; Zhang, Jiexin; Zhang, Jianjun; Do, Kim-Anh; Baladandayuthapani, Veerabhadran (October 2022, Biometrics)

Abstract The successful development and implementation of precision immuno-oncology therapies requires a deeper understanding of the immune architecture at a patient level. T-cell receptor (TCR) repertoire sequencing is a relatively new technology that enables monitoring of T-cells, a subset of immune cells that play a central role in modulating immune response. These immunologic relationships are complex and are governed by various distributional aspects of an individual patient's tumor profile. We propose Bayesian QUANTIle regression for hierarchical COvariates (QUANTICO) that allows simultaneous modeling of hierarchical relationships between multilevel covariates, conducts explicit variable selection, estimates quantile and patient-specific coefficient effects, to induce individualized inference. We show QUANTICO outperforms existing approaches in multiple simulation scenarios. We demonstrate the utility of QUANTICO to investigate the effect of TCR variables on immune response in a cohort of lung cancer patients. At population level, our analyses reveal the mechanistic role of T-cell proportion on the immune cell abundance, with tumor mutation burden as an important factor modulating this relationship. At a patient level, we find several outlier patients based on their quantile-specific coefficient functions, who have higher mutational rates and different smoking history.
more » « less
RADIOHEAD: Radiogenomic analysis incorporating tumor heterogeneity in imaging through densities

https://doi.org/10.1214/21-AOAS1458

Mohammed, Shariq; Bharath, Karthik; Kurtek, Sebastian; Rao, Arvind; Baladandayuthapani, Veerabhadran (December 2021, The Annals of Applied Statistics)

Full Text Available
Bayesian graphical models for modern biological applications

https://doi.org/10.1007/s10260-021-00572-8

Ni, Yang; Baladandayuthapani, Veerabhadran; Vannucci, Marina; Stingo, Francesco C. (May 2021, Statistical Methods & Applications)

Abstract Graphical models are powerful tools that are regularly used to investigate complex dependence structures in high-throughput biomedical datasets. They allow for holistic, systems-level view of the various biological processes, for intuitive and rigorous understanding and interpretations. In the context of large networks, Bayesian approaches are particularly suitable because it encourages sparsity of the graphs, incorporate prior information, and most importantly account for uncertainty in the graph structure. These features are particularly important in applications with limited sample size, including genomics and imaging studies. In this paper, we review several recently developed techniques for the analysis of large networks under non-standard settings, including but not limited to, multiple graphs for data observed from multiple related subgroups, graphical regression approaches used for the analysis of networks that change with covariates, and other complex sampling and structural settings. We also illustrate the practical utility of some of these methods using examples in cancer genomics and neuroimaging.
more » « less
Bayesian data integration and variable selection for pan‐cancer survival prediction using protein expression data

https://doi.org/10.1111/biom.13132

Maity, Arnab Kumar; Bhattacharya, Anirban; Mallick, Bani K.; Baladandayuthapani, Veerabhadran (November 2019, Biometrics)

Full Text Available
Radiologic Image-Based Statistical Shape Analysis of Brain Tumours

https://doi.org/10.1111/rssc.12272

Bharath, Karthik; Kurtek, Sebastian; Rao, Arvind; Baladandayuthapani, Veerabhadran (March 2018, Journal of the Royal Statistical Society Series C: Applied Statistics)

Summary We propose a curve-based Riemannian geometric approach for general shape-based statistical analyses of tumours obtained from radiologic images. A key component of the framework is a suitable metric that enables comparisons of tumour shapes, provides tools for computing descriptive statistics and implementing principal component analysis on the space of tumour shapes and allows for a rich class of continuous deformations of a tumour shape. The utility of the framework is illustrated through specific statistical tasks on a data set of radiologic images of patients diagnosed with glioblastoma multiforme, a malignant brain tumour with poor prognosis. In particular, our analysis discovers two patient clusters with very different survival, subtype and genomic characteristics. Furthermore, it is demonstrated that adding tumour shape information to survival models containing clinical and genomic variables results in a significant increase in predictive power.
more » « less
Predictions, Role of Interventions and Effects of a Historic National Lockdown in India's Response to the the COVID-19 Pandemic: Data Science Call to Arms

https://doi.org/10.1162/99608f92.60e08ed5

Ghosh, Parikshit; Baladandayuthapani, Veerabhadran; Banerjee, Mousumi; Mukherjee, Bhramar; Ray, Debashree; Salvatore, Maxwell; Bhattacharyya, Rupam; Wang, Lili; Du, Jiacong; Mohammed, Shariq; et al (June 2020, Harvard Data Science Review)

With only 536 COVID-19 cases and 11 fatalities, India took the historic decision of a 21-day national lockdown on March 25, 2020. The lockdown was first extended to May 3 soon after the analysis of this article was completed, and then to May 18 while this article was being revised. In this article, we use a Bayesian extension of the susceptible-infected-removed (eSIR) model designed for intervention forecasting to study the short- and long-term impact of an initial 21-day lockdown on the total number of COVID-19 infections in India compared to other, less severe nonpharmaceutical interventions. We compare effects of hypothetical durations of lockdown on reducing the number of active and new infections. We find that the lockdown, if implemented correctly, can reduce the total number of cases in the short term, and buy India invaluable time to prepare its health care and disease-monitoring system. Our analysis shows we need to have some measures of suppression in place after the lockdown for increased benefit (as measured by reduction in the number of cases). A longer lockdown from 42–56 days is preferable to substantially ‘flatten the curve’ when compared to 21–28 days of lockdown. Our models focus solely on projecting the number of COVID-19 infections and thus inform policymakers about one aspect of this multifaceted decision-making problem. We conclude with a discussion on the pivotal role of increased testing, reliable and transparent data, proper uncertainty quantification, accurate interpretation of forecasting models, reproducible data science methods, and tools that can enable data-driven policymaking during a pandemic. Our software products are available at covind19.org.
more » « less
Full Text Available

Search for: All records